A 3-factor approximation algorithm for a Minimum Acyclic Agreement Forest on k rooted, binary phylogenetic trees
نویسندگان
چکیده
Molecular phylogenetics is a well-established field of research in biology wherein phylogenetic trees are analyzed to obtain insights into the evolutionary histories of organisms. Phylogenetic trees are leaf-labelled trees, where the leaves correspond to extant species (taxa), and the internal vertices represent ancestral species. The evolutionary history of a set of species can be explained by more than one phylogenetic tree, giving rise to the problem of comparing phylogenetic trees for similarity. Various distance metrics, like the subtree prune-and-regraft (SPR), tree bisection reconnection (TBR) and nearest neighbour interchange (NNI) have been proposed to capture this similarity. The distance between two phylogenetic trees can also be measured by the size of a Maximum Agreement Forest (MAF) on these trees, as it has been shown that the rooted subtree prune-and-regraft distance is 1 less than the size of a MAF. Thus, the smaller this size, the greater is the similarity between the two trees. Since computing a MAF of minimum size is an NP-hard problem, researchers have turned their attention to computing MAFs that approximate the minimum. Recently, it has been shown that the MAF on k(≥ 2) trees can be approximated to within a factor of 8. In this paper, we improve this ratio to 3. For certain species, however, the evolutionary history is not completely tree-like. Reticulation events, such as horizontal gene transfer (HGT), hybridization and recombination have played a significant role in the evolution of these species. Suppose we have two phylogenetic trees each of which is for a gene of the same set of species. Due to reticulate evolution the two gene trees, though related, appear different, making a phylogenetic network a more appropriate representation of reticulate evolution. A phylogenetic network contains hybrid nodes for the species evolved from two parents. The number of such nodes is its hybridization number. It has been shown that this number is 1 less than the size of a Maximum Acyclic Agreement Forest (MAAF). We show that the MAAF for k(≥ 2) phylogenetic trees can be approximated to within a factor of 3. ∗Research supported by an NSERC Grant to this author †Now a doctoral student at Concordia University, Montreal 1 ar X iv :1 40 7. 71 25 v1 [ cs .D S] 2 6 Ju l 2 01 4
منابع مشابه
Approximation Algorithms for Nonbinary Agreement Forests
Given two rooted phylogenetic trees on the same set of taxa X, the Maximum Agreement Forest problem (MAF) asks to find a forest that is, in a certain sense, common to both trees and has a minimum number of components. The Maximum Acyclic Agreement Forest problem (MAAF) has the additional restriction that the components of the forest cannot have conflicting ancestral relations in the input trees...
متن کاملComputing Hybridization Networks for Multiple Rooted Binary Phylogenetic Trees by Maximum Acyclic Agreement Forests
It is a known fact that given two rooted binary phylogenetic trees the concept of maximum acyclic agreement forests is sufficient to compute hybridization networks with minimum hybridization number. In this work we demonstrate by, first, presenting an algorithm and, second, showing its correctness that this concept is also sufficient in the case of multiple input trees. In detail, we show that ...
متن کاملFast computation of all maximum acyclic agreement forests for two rooted binary phylogenetic trees
Evolutionary scenarios displaying reticulation events are often represented by rooted phylogenetic networks. Due to biological reasons, those events occur very rarely, and, thus, networks containing a minimum number of such events, so-called minimum hybridization networks, are of particular interest for research. Moreover, to study reticulate evolution, biologist need not only a subset but all ...
متن کاملComputing a Relevant Set of Nonbinary Maximum Acyclic Agreement Forests
There exist several methods dealing with the reconstruction of rooted phylogenetic networks explaining different evolutionary histories given by rooted binary phylogenetic trees. In practice, however, due to insufficient information of the underlying data, phylogenetic trees are in general not completely resolved and, thus, those methods can often not be applied to biological data. In this work...
متن کاملAlgorithms for Maximum Agreement Forest of Multiple General Trees
The Maximum Agreement Forest (Maf) problem is a well-studied problem in evolutionary biology, which asks for a largest common subforest of a given collection of phylogenetic trees with identical leaf label-set. However, the previous work about the Maf problem are mainly on two binary phylogenetic trees or two general (i.e., binary and non-binary) phylogenetic trees. In this paper, we study the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1407.7125 شماره
صفحات -
تاریخ انتشار 2014